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ISOLATED NUCLEIC ACID MOLECULES WHICH ENCODE T CELL INDUCIBLE 
FACTORS (TIFs), THE PROTEINS ENCODED, AND USES THEREOF 



FIELD OF THE INVENTION 

This invention relates to newly isolated nucleic acid molecules and their uses. The 
nucleic acid molecules are shown to be upregulated by the cytokine interleukin-9 ("IL-9"). 
Also disclosed are the proteins encoded thereby. They are described as T Cell Derived 
Inducible Factors ( " TIFs " ) . Thes e nucleic acid molecules encode proteins which induce STAT 
activation in cells. They can be used, for example, in the stimulation of regeneration of 
targeted tissues . Further, their inhibitors or antagonists can be used to retard, prevent or 
inhibit differentiation of other tissues. 

BACKGROUND AND PRIOR ART 

The last decade has seen knowledge of the immune system and its regulation expand 
tremendously. One area of particular interest has been that of research on the proteins and 
glycoproteins which regulate the immune system. One of the best known families of these 
molecules are the cytokines. These are molecules which are involved in the "communication" 
of cells with each other. The individual members of the cytokine family have been found to 
be involved in a wide variety of pathological conditions, such as cancer and allergies. 
Whereas sometimes the cytokines are involved in the pathology of the condition, they are also 
known as being therapeutically useful. 

Interleukins are one type of cytokine. The literature on interleukins is vast. An 
exemplary, but by no means exhaustive listing of the patents in this area includes U.S. Patent 
No. 4,778,879 to Mertelsmann et al.; U.S. Patent No. 4,490,289 to Stern; U.S. Patent No. 
4,518,584 to Mark et al.; and U.S. Patent No. 4,851,512 to Miyaji et al. , all of which involve 
interleukin-2 or "IL-2. " Additional patents have issued which relate to interleukin-1 ("IL-1 "), 
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such as U.S. Patent No. 4,808,611 to Cosman. The disclosure of all of these patents are 
incorporated by reference herein. More recent patents on different interleukins include U.S. 
Patent Nos. 5,694,234 (IL-13); 5,650,492 (IL-12); 5,700,664, 5,371,193 and 5,215,895 (IL- 
11); 5,728,377, 5,710,251, 5,328,989 (IL-10); 5,580,753, 5,587,302,5,157,112, 5,208,218 
(IL-9); 5,194,375, 4,965,195 (IL-7); 5,723,120, 5,178,856 (IL-6), and 5,017,691 (IL-4). 
Even a cursory review of this patent literature shows the diversity of the properties of the 
members of the interleukin family. One can assume that the larger cytokine family shows 
even more diversity. See, e.g., Aggarwal etal., ed., Human Cytokines: Handbook For Basic 
And Clinical Research (Blackwell Scientific Publications, 1992), Paul, ed., Fundamental 
Immunology (Raven Press, 1993), pg 763-836, "T-Cell Derived Cytokines And Their 
Receptors", and "Proinflammatory Cytokines and Immunity." All cited references are 
incorporated by reference. 

The relationships between various cytokines are complex. As will be seen from the 
references cited herein, as the level of a particular cytokine increases or decreases, this can 
affect the levels of other molecules produced by a subject, either directly or indirectly. 
Among the affected molecules are other cytokines. 

The lymphokine IL-9, previously referred to as "P40," is a T-cell derived molecule 
which was originally identified as a factor which sustained permanent antigen independent 
growth of T4 cell lines. See, e.g., Uyttenhove et al., Proc. Natl. Acad. Sci. 85: 6934 (1988), 
and Van Snick et al. , J . Exp. Med. 169: 363 (1989), the disclosures of which are incorporated 
by reference, as is that of Simpson et al., Eur. J. Biochem. 183: 715 (1989). 

The activity of IL-9 was at first observed on restricted T4 cell lines, failing to show 
activity on CTLs or freshly isolated T cells. See, e.g., Uvttenhove et al ., supra , and Schmitt 
et al., Eur. J. Immunol. 19: 2167 (1989). This range of activity was expanded when 
experiments showed that EL-9 and the molecule referred to as T cell growth Factor III ( "TCGF 
III") are identical to MEA (Mast Cell Growth Enhancing Activity), a factor which potentiates 
the proliferative response of bone marrow derived mast cells to IL-3, as is described by 
Hultner et al., Eur. J. Immunol, and in U.S. patent application Serial Number 498,182 filed 
March 23, 1990, the disclosures of both being incorporated by reference herein. It was also 
found that the human form of IL-9 stimulates proliferation of megakaryoblastic leukemia. See 
Yang et al., Blood 74: 1880 (1989). Recent work on IL-9 has shown that it also supports 
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erythroid colony formation (Donahue et al., Blood 75(12): 2271-2275 (6-15-90)); promotes 
the proliferation of myeloid erythroid burst formation (Williams et al. , Blood 76: 306-3 1 1 (9- 
1-90); and supports clonal maturation of BFU-E's of adult and fetal origin (Holbrook et al., 
Blood 77(10): 2129-2134 (5-15-91)). Expression of IL-9 has also been implicated in 
5 Hodgkins's disease and large cell anaplastic lymphoma (Merz et al., Blood 78(8): 1311-1317 

(9-1-90). Genetic analyses of mice that were susceptible or resistant to the development of 
bronchial hyperresponsiveness have unraveled a linkage with the IL-9 gene as well as a 
correlation between IL-9 production and susceptibility in this model ( Nicolaides etal.. Proc. 
Natl. Acad. Sci. USA, 94, 13175-13180, 1997). Human genetic studies also point to the IL-9 

10 and IL-9R genes as candidates for asthma ( Doull etal.. Am. J. Respir. Crit. Care Med., 153 . 

1280-1284, 1996; Holrovd et al. . Genomics 52, 233-235, 1998). Secondly, IL-9 transgenic 
mice allowed for the demonstration that increased IL-9 expression result in lung mastocytosis, 
hypereosinophilia, bronchial hyperresponsiveness and high levels of IgE ( Temann et al.. J. 
Exp. Med. 188, 1307-1320, 1998; Godfraind et al. . J. Immunol. 160, 3989-3996, 1998; 

15 McLane et al. . Am. J. Resp. Cell. Mol. 19:713-720 (1999). Taken together, these 

observations strongly suggest that IL-9 plays a major role in this disease Additional work has 
implicated IL-9 and muteins of this cytokine in asthma and allergies. See, e.g. PCT 
Application US96/12757 (Levitt, et al), and PCT US97/21992 (Levitt, et al), both of which 
are incorporated by reference.. 

20 IL-9 is known to affect the levels of other molecules in subjects. See Louahed et al. . 

J. Immunol. 154: 5061-5070 (1995; Demoulin et al. . Mol. Cell. Biol. 16: 4710-4716 (1996), 
both incorporated by reference. It will be recognized that the molecules affected have their 
own functions in biological systems. For example, Demoulin et al. show that many of the 
known activities of IL-9 are mediated by activation of STAT transcription factors. As such, 

25 there is continued interest in trying to identify molecules whose presence and/or level is 

affected by other molecules, such as cytokines. 

The disclosure which follows describes such molecules. It was found that nucleic acid 
molecules encoding the proteins of the invention were expressed in the presence of DL-9, but 
not in its absence. Hence, these molecules are, inter alia, "markers" for the expression or 

30 effect of IL-9 in a subject. The molecules are referred to as T Cell Derived Inducible Factors 
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or "TIFs" hereafter. These and other features of the invention will be seen in the disclosure 
which follows. 



BRIEF DESCRIPTION OF THE FIGURE 

Figure 1 compares deduced amino acid sequences of murine and human TIF (SEQ ED 
5 NOS: 27 and 28, respectively). 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Example 1 

The murine lymphoma cell line BW5147 is well known as a cell line which can he 
grown in vitro , without the need to add any cytokines to its culture medium. In order to 

10 identify genes induced by IL-9, samples of BW5 147 were cultured either with (200 U/ml), or 

without IL-9, for 24 hours. Then, total RNA was isolated, using guanidium isothiocyanate 
lysis, and CsCl gradient centrifugation. These techniques are well known in the art. 
Following this, polyadenylated RNA was purified from the total RNA, by using an oligo(dT) 
cellulose column. The isolated, polyA RNA was then used to generate double stranded 

15 cDNA. A commercially available oligo(dT) primer was used. Anywhere from 3-5 ug of 

polyA RNA were heated to 70°C for 10 minutes with l^g of oligo dT, and then incubated 
with 5x first strand buffer (250 mM HC1 (pH 8.3), 375 mM KC1, 15 mM MgCy, 10 mM 
dithiothreitol, 500 uM of deoxynucleotide triphosphates, and 800 U of reverse transcriptase. 
Total volume of the reaction mixture was 20 ul, and the reaction was allowed to proceed at 

20 37 °C for one hour. This resulted in synthesis of the first stand of cDNA. Second strand 

synthesis was accomplished by adding 30ul of 5 second strand buffer (lOOmM Tris-HCl (pH 
6.9)), 450mM KC1, 23mM MgCl 2 , 0.75mM p-NAD + , 50mM (NH 4 ) 2 S0 4 , together with 60U 
of E. coli derived DNA polymerase I, 2U of E. coli RNase H, 10 U of E. coli DNA ligase, 
and 250 uM of deoxynucleotide triphosphates, and brought to a final volume of 150 ul. The 

25 mixture was incubated for two hours, at 16°C. 

The product was extracted using phenol-chloroform, and was precipitated with ethanol. 
The final cDNA product was then resuspended in 200 iA of TE. 

These steps were carried out for both the stimulated BW5 147 cells (" tester" hereafter) , 
and for parallel, unstimulated BW5147 cells ("driver" hereafter). 

4 
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Example 2 

The cDNA prepared in Example 1 was then subjected to subtraction cloning in 
accordance with well known methods. To do this, six oligonucleotides were prepared: 
5'-AGCACTCTCC AGCCTCTCAC CGCA-3 (SEQ ID NO: 1); 

5 5 ' -GATCTGCGGT GA-3' (SEQ ID NO: 2); 

5'-ACCGACGTCG ACTATCCATG AACA-3* (SEQ ID NO: 3); 

5 1 -G ATCTGTTC A TG-3' (SEQ ID NO: 4); 

5 ' -AGGCA ACTGT GCTATCCGAG GGAA-3' (SEQ ID NO: 5); and 

10 

5'-GATCTTCCCT CG-3' (SEQ ID NO: 6). 



These were used as explained herein. Double stranded cDNA (2 ug), was digested with 
restriction endonuclease DpnII, extracted with phenol-chloroform, precipitated with ethanol, 
and resuspended in 20 ul of TE (lOmM Tris-HCl (pH 7.5); ImM EDTA). Twelve ul (1.2 
ug), of cut cDNA was ligated to double stranded SEQ ID NOS: 1 and 2, in a mixture which 
included 4 ul of desalted SEQ ID NO: 1 (2mg/ml), 4 ul desalted SEQ ID NO: 2 (1 mg/ml), 
10pl of 5X adapter buffer (330mM Tris-HCl, pH 7.6, 50mM MgCl 2 , 5mM ATP), 7 M 1 DTT 
(lOOmM), and 28^1 of H 2 0). The oligonucleotides were annealed to each other and to the 
sample DNA by heating the mixture to 50°C and then cooling it to 10°C over one hour, 
followed by adding 5ul of T4 DNA ligase, and incubation for 12-14 hours, at 12-16°C. The 
mixtures were diluted by adding 140 ul of TE. PCR was then carried out on 200 ul samples, 
as described infra. 



Example 3 

To carry out PCR, 200 ul samples containing 2 ul of the ligation product in a buffer 
of 66 mM Tris-HCl, pH 8.8, 4 mM MgCl 2 , 16 mM (NH 4 ) 2 S0 4 , 33 ug/ml BSA, 0.3 mM of 
each dNTP (concentration: 500 jiM), and 2 ug of SEQ ID NO: 2 were first heated at 72°C 
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for three minutes to remove any of SEQ ID NO: 1 which was hybridized to the product of 
Example 2. The 3' ends were then filled in by using 5U of Taq polymerase (5 minutes, . 
72°C). Twenty cycles of amplification were carried out (1 cycle: 1 minute at 95°C, and three 
minutes at 72°C), after which products were combined, phenol extracted, ethanol precipitated, 
and resuspended in TE buffer, at a concentration of 0.5 ug/ul. Hereinafter, this is referred 
to as the representation. 

Example 4 

The representation was then prepared for subtractive hybridization by removing SEQ 
ID NO: 1 therefrom by digestion with Dpn II. The resulting digest was phenol extracted and 
ethanol precipitated. In the case of the unstimulated sample, this resulted in the driver, while 
the stimulated sample resulted in the tester. Portions of tester (20 ug) were gel purified on 
a 1.2% agarose gel and isolated. Samples (2 ug), were ligated to SEQ ID NOS: 3 and 4, in 
the same way that SEQ ID NOS: 1 and 2 were ligated, as described, supra. 

In a first cycle of subtractive hybridization, 0.4 ug samples of tester with SEQ ID 
NOS: 3 and 4 ligated thereto were mixed with 40 ug of driver cDNA. The mixture was 
phenol extracted, ethanol precipitated, dissolved in 2 ul of 3XEE buffer (30mM EPPS pH 
8.0), 3mM EDTA; pH 8.0, 3 mM EDTA. This was overlaid with 30 ul of mineral oil, and 
denatured for five minutes at 98 °C. A 5M NaCl solution (0.5 ul) was added, and DNA was 
hybridized for 20 hours, at 67 °C. The reaction mixture was diluted to 200 ul with TE, and 
tRNA carrier. The samples were incubated for three minutes at 72 °C to melt away SEQ ID 
NO: 4, and then four PCR reactions (200 ul) were prepared. These included 20 ul of diluted 
hybridization mix without primer, to fill in the ends of the reannealed tester, followed by 10 
cycles of amplification after adding samples of SEQ ID NO: 3(1 cycle: Iminute at 95°C, three 
minutes at 70 °C) after which products were combined, phenol extracted, ethanol precipitated, 
and resuspended in 40 pi of 0.2XTE buffer. Single stranded DNA was degraded by a 30 
minute treatment of 20 /il of this material with 20U of mung bean nuclease, at a total volume 
of 40 ul. Samples was diluted (1:5), in 50 mM Tris-HCl, at pH 8.9, followed by five minutes 
of heating at 98°C to inactivate the enzyme. A second PCR was carried out, using 20 ul of 
the product described supra . 2 ul of SEQ ID NO: 3 (1 mg/ml), and 1 ul (5 U) of Taq DNA 
polymerase. A total of 18 cycles (1 cycle: 1 minute at 95 °C, three minutes at 70°C) were 
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carried out. Products were combined, phenol extracted, ethanol precipitated, and resuspended 
at 0.5-1 ug//xl. The product is referred to hereafter as "DPI or the first difference product. 

Example 5 

DPI was then digested with endonuclease DpnII, as described above, and was ligated 
to SEQ ID NOS: 5 and 6, following the same processes described for SEQ ID NOS: 1, 2, 3 
and 4. Subtractive hybridization and selective amplification, as described in example 4, was 
repeated, and second difference product, or "DP2", was generated. In these experiments, 50 
ng of DPI was the tester. The driver (40 ug), was as described supra . The process was 
repeated to generate a third difference product, using SEQ ID NOS: 3 and 4 as adapters. To 
generate the third product, 100 pg of tester were mixed with 40 fig of driver. All steps of the 
protocols supra were repeated, except the final amplification was carried out for 22 cycles, 
where one cycle was one minute at 95 °C, and three minutes at 70°C. This yielded the final 
difference product. 

Example 6 

The final difference products were digested with DpnII, and then cloned into the 
BamHI site of a commercially available vector, i.e. , ptZ19R. Double stranded DNA plasmids 
were prepared, and then sequenced, using standard methods. The sequences were compared 
to known sequences in the GenBank and EMBL data bases, using a BLAST search program. 

At the end of this subtraction procedure, a short cDNA fragment was identified, i.e., 
a fragment about 200 base pairs long. This fragment was used to screen a cDNA library from 
BW 5147 cells. The largest clone was sequenced. It is discussed infra. It does not 
correspond to any known sequence. 

The nucelotide sequence (SEQ ID NO: 7), is 1121 bases long, including a 537 base 
pair open reading frame, which encodes a protein 179 amino acids long. The predicted 
molecular weight of the protein is 20,093. There are two additional ATG codons which, if 
they acted as start codons, would produce proteins 172 and 167 amino acids in length, with 
molecular weights of 19,335 and 18,770 daltons, respectively. Each form of the protein is 
characterized by a sequence of hydrophobic amino acids which would be cleaved off of the 
molecule via the endoplasmic reticulum to provide a mature protein. 
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Analysis of the sequence shows three AT rich motifs (TTATTTAT). These motifs are 
often found in 5' - untranslated regions of cytokines and oncogenes. Kravs. et al. . Science 
245 : 852 (1989), have shown that these repeats modulate stability of mRNA for TIF. 

Example 7 

5 The cDNA isolated and analyzed in example 6, supra , was then used as a probe to 

identify genomic DNA for TIFa. 

A genomic library prepared from mouse strain 129 was screened with SEQ ID NO: 7, 
following standard methods. An EcoRI fragment from a positive clone was subcloned into 
plasmid pZERO and partially sequenced. The partial sequence is presented as SEQ ID NO: 

10 8. 

Example 8 

A second EcoRI fragment from the positive clone described in Example 7, supra , was 
also subcloned. There was a great deal of homology, but the sequences were not identical. 
To be specific, intron 1 of this sequence was 98% identical to SEQ ID NO: 8, intron 2 was 

15 100% identical and intron 3 was 92% identical. 

What is striking about the sequences is that the promoters are not at all homologous, 
suggesting independent regulation. The 5' untranslated regions are 92% identical. The first 
exon for TIFa is split into exon la and exon ip. The first coding exon (which is exon lb for 
TIFa and exon 1 for TIFP) are 99.5% identical, while the second exons are 100% identical, 

20 the third exons 97% identical, the fourth exons 98.5% identical, and 96% for the fifth exon. 

In the untranslated 3' - region, homology is 96%. 



Example 9 

Using the information described in example 8, supra , a cDNA sequence for the second 
clone, designated TIFP was deduced, and is set forth as SEQ ID NO: 9. The genomic DNA 
25 sequence was also ascertained, in the same manner as is described, supra , and is set forth as 

SEQ ID NO: 29. 

As compared to the coding region for TIFa, that of TIFp has six silent changes. There 
are two changes which result in an inconsequential amino acid change (at both of positions 36 

8 
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and 113, Val in TIFa becomes He in TIFP). There is also a more significant change, at 
position 1 12, where Gin becomes Arg. 

Example 10 

Experiments were undertaken to study expression of the TIFs. BW 5147 cells were 
stimulated with recombinant murine IL-9 (200U/ml), for varying periods of time (0.2, 0.5, 
1, 2 & 24 hours). Total RNA was then isolated, using standard methods and reagents. 
Reverse transcription was then carried out, using 5 fig total RNA and an oligo (dT) primer. 
Samples of cDNA corresponding to 20ng of total RNA were then amplified for 25 cycles 
using different primers. (One cycle was 4 minutes at 94 °C, 1 minute at 57°C, and 2 minutes 
at 72 °C). The TIF primers were: 

S'-CTGCCTGCTT CTCATTGCCC T-3' (SEQ ID NO: 10) 
and 

5-CAAGTCTACC TCTGGTCTCA T-3 f (SEQ ID NO: 1 1) 
(sense and antisense, respectively). 

These correspond to nucleotides 106-126, and 764-784 of SEQ ID NO: 7, respectively. 
As a control, p-actin was amplified as well, for 18 cycles (first cycle: 4 minutes at 94°C, 1 
minute at 60°C, 2 minutes at 72°C. Succeeding cycles were 1 minute at 94°C, 1 minute at 
60°C, 2 minutes at 72°C). 

Following amplification, post PCR products were analyzed on a 1 % agarose gel, and 
specific amplification was confirmed, following blotting, using internal radioactive probes. 
The probe for TIF was: 

5 ' -GACGC AAGC A TTTCTCAG AG-3 1 (SEQ ID NO: 12) 

the conditions and probes set forth were not specific for one or the other of the forms of TIF; 
however, the amplification product of TIFa contains a Kpnl restriction site, while the 
restriction site for TIFp does not. Digestion of the amplification products with Kpnl indicated 
that most, if not all, of the TIF mRNA induced by IL-9 was TIFa, suggesting that the TIFa 
expression was induced rapidly via the IL-9. The mRNA for TIFa was detectable after 30 
minutes of stimulation, and reached a plateau over a 1-24 hour time period. 

Example 11 
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Experiments were then carried out which showed that the induction of TIF mRNA by 
IL-9, described supra , does not require protein synthesis. In these experiments, total RNA 
was extracted from cells stimulated for 24 hours, as described in example 10, but with or 
without l(Vg/ml of a protein synthesis inhibitor, cycloheximide, for 4.5 hours. In a parallel 
set of experiments, cells were not stimulated. The total RNA was extracted, and RT-PCR 
amplification was carried out as described in example 10. Post-PCR products were analyzed 
on an ethidium bromide-stained, 1 % agarose gel. What was seen was that the induction by 
IL-9 still occurred when protein synthesis was blocked. Hence, the effect of IL-9 is a direct 
effect, not requiring the synthesis of a protein mediator. 
Example 12 

In these experiments, the role of STAT proteins in induction of TIF mRNA was 
studied on derivatives of the cell line BW5147. The first line, BWh9R, expresses wild type 
human IL-9 receptors. The line BW-Phell6 is a transfectant with a single mutation (at 
position 116), which renders the receptor unable to activate STAT transcription factors. Still 
another cell line, BW-mut6, has a mutation which renders the receptor unable to activate 
STATS, while retaining the ability to activate ST ATI and STAT3 . Finally , cell line BW-mut7 
has a single mutation which renders the IL-9 receptor unable to activate STAT1 and STAT3, 
but which retains the ability to activate STAT5. 

Cell stimulation, isolation of total RNA, reverse transcription and amplification of 
cDNA were all carried out as described in example 10 (Cells were stimulated for 24 hours. 
Both human and murine recombinant IL-9 were used). The PCR products were analyzed on 
an ethidium bromide stained, 1 % agarose gel, as describe supra . 

The analysis revealed that human IL-9 did not induce expression in BW-Phell6, 
suggesting that STAT transcription factors are implicated. It was found that IL-9 induced TIF 
expression in the BW-mut6 mutant, but not the mut7 variant, suggesting that STAT1 or 
STAT3 are involved, but not STATS. 

Example 13 

The expression of TIF mRNA in normal mouse spleen cells was then studied. 
Spleen cells from 10-12 week old Balb/c mice were cultured for 24 hours in control 
medium or the control medium supplemented with 20/*g/ml of LPS (which activates B 
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lymphocytes and macrophages), or ConA (which activates T cells), or ConA plus 1% of a 
blocking antiserum against murine IL-9, with P actin being used as a control. Purification of 
RNA, RT-PCR analysis were carried out as described supra . 

The data indicated that TIF is, at best, very weakly expressed in resting spleen cells, 
5 not induced by LPS, but strongly induced by ConA. Anti IL-9 antiserum did not affect 

induction by ConA, suggesting that its effect is not mediated by IL-9, or is mediated by other 
cytokines. 

When the ConA activated spleen cells were analyzed using sequences of RT-PCR 
products, it was found that these cells were expressing TIFa predominantly, or exclusively. 

10 Example 14 

Further experiments showed that TIF mRNA was expressed even in the absence of IL- 

9 induction. 

Spleen cells from 5 week old FVB mice were enriched for T cells, using a nylon wool 
column. Then, the cells were stimulated for 24 hours in medium supplemented with ConA 
15 (a T cell activator), or PMA (which activates PKC in most cells), either with or without IL-9. 

Total RNA was isolated using standard techniques, and then ten microgram samples 
were fractionated via electrophoresis on a 1.3% agarose gel containing 2.2M formaldehyde. 
The fractions were then transferred to a nitrocellulose membrane, labeled, and assayed in a 
hybridization assay following Van Snick, et al, J. Exp. Med. 169: 363 (1989), incorporated 

20 by reference. 

The results indicated that the induction of TIF by ConA was not modified, and that IL- 

9 did not induce TIF RNA in PMA activated spleen cells. 
Example 15 

The expression of TIF mRNA in various cell lines was tested. In these experiments, 
25 murine cell lines were stimulated for at least one day, with a particular cytokine. Specifically, 

9T7 is a T cell lymphoma, which responds to IL-2, IL-4 or IL-9. Cell lines TS3 and TS6 are 
derived from T helper cell clones, and proliferate in the presence of either IL-2 or IL-9. MC9 
and LI38 are mast cell lines, which proliferate in the presence of either IL-3 or IL-9. 
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Following stimulation, total RNA was prepared using standard guanidium 
isothiocyanate lyses, and CsCl gradient centrifugation. 

The 9T7 line was then analyzed by Northern blotting, as described in example 14, 
while the other lines were assayed using RT-PCR analysis, as described supra. 

It was found that IL-9 upregulated TIF expression in T helper cells and mast cells, 
while IL-2 and IL-3 did not. The 9T7 cell line, however, showed roughly the same level of 
expression, regardless of the cytokine, indicating that IL-9 is not mandatory for TIF 
expression. 

Example 16 

The expression of TIF mRNA in B cell lines was then studied. The cell lines A20, 
70Z/3, and BCL-1 are B cell leukemia cell lines which grow, in vitro , without cytokines. 
These cells were stimulated for 24 hours with IL-4 and IL-9 and total RNA was isolated, using 
standard methods. Expression was analyzed by RT-PCR which was carried out for 35 cycles, 
followed by blotting and hybridization, as described supra . 

The results indicated that TIF expression is detectable in B cells, but is weakly 
upregulated at best in the presence of IL-9 and IL-4. 

Example 17 

Experiments were then carried out to study expression of the inventive molecules in 
T helper cell lines. TS2 and TS1 are known T helper cell lines, derived from T helper cell 
clones, which proliferate in the presence of either IL-9 or IL-2 (TS2), and either IL-9 or IL-4 
(TS1). Specifically, TS1 or TS2 cells were grown in the presence of the listed cytokines for 
at least 10 days, after which RNA was extracted using known methods. Expression of the 
molecules was studied via RT-PCR (35 cycles), using the protocols described supra . In TS1 
cells both IL-4 and IL-9 induce TIF expression, but IL-2 does not do so in TS2 cells. 

Example 18 

Expression of TIF mRNA in various mouse organs were studied. Total RNA was 
prepared from liver, kidney, heart, brain, intestine, spleen, thymus, lung, muscle and bone 
marrow, using standard guanidium isothiocyanate methodologies and CsCl gradient 
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centrifugation. Forty cycles of RT-PCR were carried out, using the protocols described supra. 
Strongest expression was found in thymus tissue, while less intense signals were found in brain 
tissue, and weaker expression in the remaining tissues. 

Exam ple 19 

5 The following experiments describe production of TIFa in 293-EBNA ceils. 

Complementary DNA for TIFa was described supra . It was subcloned into a 
commercially available expression vector pCEP-4, in operable linkage with a CMV promoter. 
The resulting plasmids were transfected into 293-EBNA cells, using standard lipofectamine 
methods. Following transfection, the cells were incubated in a methionine free medium, 

10 supplemented with 35 S labeled methionine, for 24 hours. Supernatant was harvested, and run 

on an acrylamide gel, followed by electrophoresis. The gel was then dried and exposed to 
autoradiography for 1 day. A control was then run by transfecting cells with the same 
plasmid, in which the cDNA was cloned in the antisense direction. 

A heterogenous band of about 25-30 kilodaltons was found from the cells transfected 

15 with TIF in the sense direction. Any discrepancies between the predicted molecular weight, 

the actual molecular weight in the system, and the heterogeneity, can be attributed to 
glycosylation. In a series of parallel experiments, cDNA encoding human TIF was expressed 
in the same way as the murine cDNA was expressed. With the exception of the change of the 
cDNA, all experimental parameters were the same. 

20 Example 20 

Further experiments were carried out to study production of TIFa in COS cells. 
Specifically, TIFa cDNA was subcloned into the plasmid pEF-BOS.puro described by 
DeTTioulinetal.. supra , in operable linkage with the EF-la promoter. The plasmid cDNA was 
transfected into COS cells, using the same lipofectamine method described supra. The cells 

25 were incubated in methionine free medium, supplemented with methionine for 24 hours, 

after which supernatant was treated as described in example 20, supra. Again, a heterogenous 
band of 25— 30 kilodaltons was observed, as well as an 18 kilodalton band, which probably 
represents a non-glycosylated form of the molecule. 
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Example 21 

In these experiments, it was discovered that TIF induces STAT activation in mesangial, 
neuronal melanoma, and hepatoma cells. It is known that when cytokines activate STAT 
factors, the factors dimerize, move from cytoplasm to the nucleus, and bind to target 
sequences in promoters. The details of the experiments follow. 

Transfected 293-EBNA cells as described supra were used following incubation in 
normal medium for 48 hours, as were supernatant from the controls, also described supra. 
Samples ofa mouse kidney mesangial cell line, ("MES13" hereafter), aratpheochromocytoma 
cell line, ("PC12" hereafter), four different human melanomas (SK23, AUMA, NA-8mel and 
MULL), human heptaoma (HepG3) and rat hepatoma (H-4-II-K) were used. Cell samples 
(0.5xl0 6 ) were stimulated for 5-10 minutes in the presence of 1% of supernatant. Nuclear 
extracts were then prepared, in accordance with Demoulin etal., Mol. Cell. Biol. 16: 4710 
(1996), incorporated by reference. In brief, cells were washed with PBS and then resuspended 
in 1 ml of ice cold hypotonic buffer for 15 minutes. (Buffer was lOmM HEPES buffer, pH 
7.5, with lOmM KC1, ImM MgCl 2 , 5% glycerol, 0.5 mM EDTA, O.lmM EGTA, 0.5mM 
dithiothreitol, and ImM Pefabloc, ImM Na 3 V 4 , and 5mM NaF). Cells were then lysed by 
adding 65 pi of NP-40, followed by vortexing. Nuclei were pelleted, by vortexing for 30 
seconds at 14,000 rpm, followed by extraction in buffer supplemented with HEPES (20mM), 
glycerol (20%), and NaCl (420mM). Nuclear debris was removed by centrifuging for 2 
minutes. DNA binding activity was determined in accordance with Demoulin et al., supra, 
using a 32 P labeled double stranded oligonucleotide called "GRR," which contains the STAT 
binding site of the FcyRI gene promoter, i.e.: 
5 ' ATGTATTTCC CAGAA A-3 ' (SEQ ID NO: 13) 

and 

5 ' -CCTTTTCTGG GAAATAC-3' (SEQ ID NO: 14) 

corresponding to the upper and lower strands of the binding sites in the GRR probe. Briefly, 
5pl volume of nuclear extracts were incubated in binding buffer (12mM HEPES, pH 7.6, 
lOmM KC1, 0.5mM EDTA, 2.5% glycerol, O.lmg of poly(dl-dC) per ml) for 5 minutes. 
Radiolabeled GRR probe (lO'cpm; approximately 0.5ng) was added, and incubation was 
continued for 25 minutes before loading onto a non-denaturing polyacrylamide gel. 
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It was also noted that the complexes observed in MES13 cells, described supra , were 
partially overshifted by both anti-STAT5 and anti-STAT3 antibodies, showing that (i) the cells 
under examination were targets for TIF, and (ii) that STAT3 and STATS are major 
components of the complex activated by TIF. The difference in STAT profile, as compared 
to the profile in Example 12, supra , is attributable to the difference in cell source (human 
versus mouse). It was also observed that human TIF works on murine cells, and vice versa. 

Example 22 

This example details the isolation and cloning of a nucleic acid molecule which encodes 
human TIF. First, human peripheral blood mononuclear cells were prepared via standard 
density gradient centrifugation. Following this preparation, samples were cultured for 24 
hours, at 3 x 10 6 cells/ml, either with or without anti-CD3 monoclonal antibody (The antibody 
was the commercially available OKT3 mAb, used in the form of ascites fluid at 1/500 
dilution). This antibody was used because T cell derived cytokines are generally expressed 
only upon activation by e.g., CD3 specific antibodies. 

Total RNA was isolated from these cells, using standard guanidine-isothiocyanate / 
CsCl ultra-centrifiigation techniques. Following isolation, lOyug samples of the RNA were 
reverse transcribed using an oligo (dT)15 primer. 

Following preparation of cDNA, as outlined supra , samples which corresponded to 
lOOng of total RNA were amplified, via PCR, using the following primers: 
5' - AGCTGCTCAA CTTCACCCTG GA - 3' (SEQ ID NO: 15) 
5' - CCACTCTCTC CAAGCTTTTT CA - 3' (SEQ ID NO: 16) 

which are based upon a murine cDNA sequence, (i.e., SEQ ID NO: 7). The PCR conditions 
involved 30 cycles of amplification, with one cycle defined as 1 minute at 94°C, followed by 
1 minute at 42 °C, and then 2 minutes at 72°C. Amplification product was separated on an 
agarose gel, using standard methods, and then sequenced. The result indicated that fragments 
of the cDNA had been amplified. Hence, a second reaction was carried but, using the same 
materials except SEQ ID NO: 16 was replaced by SEQ ID NO: 17, i.e.: 
5' - CAAGTCTACC TCTGGTCTCA T - 3' 

This second PCR reaction was carried out for 25 cycles, with one cycle being defined as 1 
minute at 94°C, followed by 1 minute at 45°C, and then 2 minutes at 72°C. The 
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amplification product was subjected to the same steps as the first one. Again, fragments of 
cDNA were amplified. 

Example 23 

Following preparation of amplification product, the 5' end of cDNA was isolated by 
using standard, 5' - RACE techniques. In brief, first strand cDNA was prepared by using 
SEQ ID NO: 18 as a primer, i.e.: 
5' -TGGCCAGGAAGGGCACCACCT-3' 

This primer was based upon the sequence information obtained in accordance with example 
22. In brief, the 5' - RACE method was carried out by combining 1 pig of total RNA, 
prepared as described supra , 2.5 pmoles of SEQ ID NO: 18, reverse transcriptase , reverse 
transcriptase buffer, 2.5 pel of dNTP mix (10 mM), 2.5 fA of MgCI 2 (25mM), and 2.5 pel of 
dithiothreitol (0. 1 M). The reaction was carried out and, after completion, original RNA was 
removed via adding RnaseH, and Rnase TI. Any unincorporated dNTPs, as well as primer 
and proteins, were removed. The cDNA was tailed using terminal transferase, or "TdT." 
This enzyme creates a 3' -binding site for the abridged anchor primer, as described infra. 
Tailing was carried out by combining the purified, first strand cDNA, TdT, buffer (10 mM 
Tris-HCl, 25 mM KC1, 1.5 mM MgCl 2 ), and 200 /^M of dCTP. 

Following the tailing reaction, PCR was carried out using 
5' - TGGCCAGGAA GGGCACCACC T - 3' (SEQ ID NO: 19), 
and 5' - RACE abridged anchor primer: 

5' - GGCCACGCGT CGACTAGTAC GGGIIGGGIIGGGIIG - 3* (SEQ ID NO: 20). 
The amplification involved 35 cycles (1 cycle defined as 1 minute at 94°C, 1 minute at 56°C, 
and 2 minutes at 72 °C). Following this, nested amplification was performed on 5 yul of a 
1/100 dilution of the amplification product, using SEQ ID NO: 19 and the abridged universal 
amplification primer: 

5' - GGCCACGCGT CGACTAGTAC - 3* (SEQ ID NO: 21). 

Amplification involved 30 cycles (1 cycle being defined as 1 minute at 94°C, 1 minute at 
56°C, and 2 minutes at 72 °C). The resulting PCR product was cloned, following standard 
procedures, and sequenced. 
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These three protocols, i.e., the two experiments described supra which generated 
fragments, and the 5* - RACE PCR, also described supra , permitted alignment of the 
sequenced amplification product, to generate the complete sequence. 

Following the alignment, oligonucleotides were generated which flanked the deduced 
open reading frame, i.e.: 

5' - CCTTCCCCAG TCACCAGTTG - 3' (SEQ ID NO: 22) 
and 

5' - TAATTGTTAT TCTTAGCAGG - 3' (SEQ ID NO: 23). 

These primers were used to amplify the entire open reading frame, using mRNA from CD3 
specific mAb stimulated cells, as described supra . For amplification, 25 cycles (1 cycle being 
defined as 1 minute at 94°C, 1 minute at 56°C, and 2 minutes at 72°C). 

The complete sequence of the human cDNA is set forth at SEQ ID NO: 24. 

As with the murine sequence, there are potential start codons at positions of SEQ ID 
NO: 24 which correspond to amino acids 1 and 13, as well as codons corresponding to 
methionine at amino acid positions 58, 85, and 92. The possible initiator codons correspond 
to proteins with calculated molecular weight of 19,998 daltons, and 18,735 daltons 
respectively (for 176 or 167 amino acids, respectively). As with the murine form of the 
protein, hydrophobic leader sequences are seen, indicating an N-terminal signal sequence of 
from about 20 to about 40 amino acids. 

Example 24 

These experiments detail work on the isolation of human genomic DNA corresponding 
to the cDNA discussed supra . 

Based upon the cDNA sequences, primers were developed which correspond to 
nucleotides 5 1-70 and the complement of nucleotides 631-650 of SEQ ID NO: 24. PCR was 
carried out, using standard methodologies. Specifically , lOOng of genomic DNA was used as 
a template, and 33 cycles of amplification were cararied out (one cycle of amplification being 
defined as 94° C for 30 seconds, 50° C for 30 seconds, and 72° C for 5 minutes). Once a 
sequence was isolated, it was sequenced, and this is set forth as SEQ ID NO: 25. The 
sequence is about 4.8 kilobases in length, and is believed to contain the entire genomic 
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sequence encoding the TIF molecule, lacking only the 5' flanking region, the promoter, and 
the 3' end. 

Example 25 

It was of interest to identify where the genomic DNA discussed supra was located in 
the human genome. In order to do this, two different approaches were taken. In the first, the 
sequence discussed supra , i.e. , SEQ ID NO: 25, was labeled with a flourescent label, and then 
was used to probe the human genome via fluorescent, in situ hybridization ("FISH") using 
standard methods. 

In a second approach, a panel of radioactive hybrid clones were screened using the 
probe consisting of nucleotides 51-70 of SEQ ID NO: 24, and 5'-ATCAGATGGA 
TTACTGA ATG-3 ' (SEQ ID NO:26). PCR was carried out using 25 ng of genomic DNA as 
a template, for 35 cycles, where one cycle is defined as 94 °C for in minute, 55° C for i 
minute and 72 °C for 2 minutes. 

Both methodologies indicated that the gene is located at chromosome 12ql5. Some 
work links diseases associated with asthma at this site. See, e.g. Nat. Genet. 15:389-392 
(1997); Ober, et al, Hum. Mol Genet. 7(9): 1393-1398(1998); Nickel, et al, Genomic 
46(1):159-162(1997); Takahashi, etal, Genomics 44(1): 150-2(1997); Barnes, et al, Genomics 
37(1):4 1-50(1 996), all incorporated by reference.. 

Example 26 

These experiments describe the manufacture of antibodies which bind to the TIF 
protein. To make these, a peptide consisting of amino acids 40-61 encoded by SEQ ID NO: 
7 was coupled to KLH carrier protein, using standard methods and a ratio of 1 mg peptide to 
1 mg carrier protein. Subject animals (rabbits), were immunized 3 times, at 2 week intervals, 
with 150 fig of the complex. The immunogen was emulsified in Complete Freund's Adjuvant 
for the first injection, and then Incomplete Freund's Adjuvant for the next two. 

A first bleed was performed one month after the last injection, and serum was 

prepared, following known methods. 

The serum was then tested in a standard Western Blot. In brief, 10 pi of supernatant 
from cells transfected with either SEQ ID NO: 7 or SEQ ID NO:24 were separated via SDS- 
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PAGE electrophoresis, and then blotted onto PVDF membranes. Antiserum was diluted to 
1 :500, and used in a standard Western Blot protocol, together with anti-rabbit antibody as the 
secondary antibody, and a commercially available detection kit. 

It was found that the serum did, in fact, recognize the TIF protein. 

5 In figure 1, the deduced amino acid sequences of murine and human TIF are set out. 

The high degree of homology is seen in the boxed regions. 

The foregoing examples describe the invention, one aspect of which are isolated 
nucleic acid molecules, which encode TIF proteins such as those with the amino acid sequence 
of the protein encoded by the nucleotide sequence of SEQ ID NO: 7, 24 or 25. It will be 

10 appreciated by one of ordinary skill that the degeneracy of the genetic code facilitates the 

preparation of nucleic acid molecules which may not be identical to the nucleotide sequence 
of SEQ ID NO: 7, 24 or 25, but which encode the same protein. Of course, SEQ ID NOS: 
7, 24 and 25 are preferred embodiments of this invention, but other embodiments are also a 
part of the invention. Genomic DNA, complementary DNA, and RNA, such as messenger 

15 RNA, are all to be included therein. Isolated nucleic acid molecules from other animal 
species, including other mammals, are also a part of the invention. A preferred aspect of the 
invention are isolated nucleic acid molecules whose complements hybridize to SEQ ID NO: 
7, SEQ ID NO: 8, SEQ ID NO: 9, or SEQ ID NO: 24 under stringent conditions. "Stringent 
conditions, " as used herein, refer, for example, to hybridization at 65°C in buffer (3.5xSSC), 

20 0.02% Ficoll, 0.02% polyvinylpyrrolidone, 0.02% bovine serum albumin, 25mM NaH 2 P0 4 

(pH 7), 0. 1 % SDS, 2mM EDTA, followed by a final wash at 2xSSC, room temperature and 
then 0.1xSSC/0.2xSDS at temperatures as high as, e.g., about 65°C. More stringent 
conditions, such as 0. lxSSC, can also be used. These nucleic acid molecules encode proteins 
of about 17-22 kD as determined by SDS - PAGE, which activates STAT proteins, such as 

25 STAT 1,STAT3 and/or STAT5. In glycosylated form, these proteins can range from about 

17 to about 30 kilodaltons, as determined by SDS - PAGE. 

Also a part of the invention are expression vectors which include the nucleic acid 
molecules of the invention, operably linked to a promoter, so as to facilitate expression of the 
DNA. It is well within the skill of the artisan to prepare such vectors. 

30 The vectors, as well as the nucleic acid molecules per se, can be used to prepare 

recombinant cells, be these eukaryotic or prokaryotic, wherein either an expression vector or 
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the nucleic acid molecule itself is incorporated therein. E. coli cells, COS cells, CHO cells, 
etc. , are all examples of types of cells which may be used in accordance with this aspect of the 
invention. 

Proteins encoded by the above referenced nucleic acid molecules, preferably in isolated 
5 form, are another feature of this invention. By "protein" is meant both the immediate product 
of expression of the nucleic acid molecules, glycosylated forms of it, as well as multimeric 
forms, such as dimers, trimers, and so forth. Also a part of the invention are multimers, such 
as dimers, which contain at least one protein molecule of the invention, and at least one, 
different protein molecule. Preferably, this different protein molecule is a cytokine, such as 

10 IL-10. Also included as a feature of the inventions are constructs, such as ftision proteins, 

where all or a part of the proteins described supra are linked in some fashion, such as in a 
fusion protein, to at least one addtional protein or peptide, or amino acid sequence. The 
"fusion partner" may be, for example, a molecule which provides a recognizable signal, either 
directly or indirectly, such as a FLAG peptide, p-galactosidase, luciferase, and so forth. 

15 These fusion partners are preferably joined to the molecule which is described supra at the N- 

and/or C- terminus of the protein; however, it is to be understood that there are many 
techniques known for joining molecules to amino acids, and any and all of these methodologies 
can produce constructs which are a part of the invention. 

The individual protein molecules of the invention, as noted supra , will preferably have 

20 a molecular weight of from about 17 to about 30 kilodaltons, as determined by SDS-PAGE. 

In multimeric forms, the molecular weight of the complex will, of course, vary, but the TIF 
molecules contained therein will each have a molecular weight of about 17 to 30 kilodaltons, 
as determined by SDS-PAGE. 

The proteins preferably consist of at least about 120 and no more than about 200 amino 

25 acids. Preferably, the amino acids sequences consists of or comprises all or part of the amino 

acid sequences encoded by SEQ ID NOS: 7,8, 9, 24 or 25. More preferably, the amino acid 
sequence contains all but about the first 40 amino acids encoded by said SEQ ID's. Even 
more preferably, it contains all but about the first 20 amino acids encoded by these sequences. 
Most preferably, the protein comprises amino acids set forth at SEQ ID NO: 27 or 28. 

30 It will be appreciated by the skilled artisan that the proteins encoded by the above 

recited nucleic acid molecules are a feature of the invention, and may be used to produce 
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antibodies, in accordance with standard protocols. Such antibodies, in monoclonal and 
polyclonal form, constitute a further feature of the invention as do fragments of said 
antibodies, chimeric forms, humanized forms, recombinant forms, and so forth. Also a 
feature of the invention are immunogens, comprising all or a part of the amino acid sequence 
5 protein molecules of the invention, preferably combined with an adjuvant, such as Complete 
or Incomplete Freund's Adjuvant. Portions of the protein sequences may be linked to other 
molecules, such as keyhole limpet hemocyanin, to render them more immunogenic. These 
antibodies can be used, e.g., to determine if the proteins of the invention are present. This 
is a further feature of the invention, as is now explained. It has been shown, in the examples, 

10 that the nucleic acid molecules of the invention were expressed in the presence of the IL-9. 

Hence, a further feature of the invention is a method to determine if IL-9 is or has been 
present, wherein one detects either the proteins of the invention, using antibodies for example, 
or mRNA using the nucleic acid molecules of the invention, as probes. The mRNA can be 
determined directly, or in the form of cDNA. Such probes may or may not be labeled, as a 

15 matter of choice for the user. Hence, one can determine, for example, if, following 

administration of IL-9, the cytokine is still efficacious, by determining if the nucleic acid 
molecule of the invention is present. This type of assay can be adapted, for quantitative 
studies, wherein one determines, for example, either if a cell is sensitive to IL-9, and if so, 
how sensitive it is. One can also use the proteins of the invention to phosphorylate STAT 

20 proteins such as STAT1 , STAT3 and/or STAT 5. This in turn results in dimerization of the 
STAT protein, followed by migration to the nucleus to provoke the effect that these STAT 

proteins have on cells. 

One could also use these molecules to test the efficacy of IL-9 agonists or antagonists 

when administered to a subject, such as a subject suffering from lymphoma , an immune 
25 system disorder such as an allergy, acquired immune deficiency syndrome, autoimmune 

diabetes, thyroiditis, or any of the other conditions described in , e.g, US Patent No. 

5,830,454; 5,824,551, and pending application Serial No. 08/925,348, filed on September 8, 

1997 now allowed, all of which are incorporated by reference. The molecules can also be 

used to mediate the role of IL-9 in these and other conditions. To elaborate, since IL-9 
30 induces TIFs, the TIFs are useful as IL-9 activity mediators. Thus, a further aspect of the 

invention is a method to determine activity of endogenous IL-9, such as in situations where 
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excess IL-9 activity is implicated, such as asthmas, allergies, and lymphomas. One can also 
block or inhibit IL-9 activity by blocking or inhibiting TIF or TIF activity, using, e.g., 
antisense molecules, antibodies which bind to TIF, or other antagonists of these molecules. 
For example, m uteins of TIF, which bind to the TIF receptor but do not activate it, therby 
5 inhibiting IL-9 induced activity, are a feature of the invention. Examples of conditions which 

can be treated by the use of such TIF muteins are allergies, asthma, and so forth. Muteins in 
accordance with the invention can be made in accordance with, e.g., Weigel, et al, Eur. J. 
Biochem 180(2):295-300(1989) and Epps, et al, Cytokine 9(3): 149-156(1997), both of which 
are incorporated by reference. Such muteins can be used in the treatment of asthma, allergies, 

10 or both. Further, it will be clear to the skilled artisan that the models set forth, supra , can also 

be used to screen for appropriate muteins/ The ability to regulate IL-9 activity is important 
in conditions such as those listed supra, as well as conditions such as apoptosis, including 
Cortisol induced apoptosis, conditions involving the nuclear expression of BCL-3, since IL-9 
is known to induce such expression, and so forth. "Antibodies," as used herein, refers to any 

15 portion of an antibody which binds to TIF, including chimeric and humanized antibodies. 

Another feature of the invention relates to the ability of the TIF type molecules of the 
invention to either promote regeneration or inhibit differentiation of tissue types on which the 
molecules are active. As was shown, supra , the TIF molecules target various cancer and 
normal cell lines (i.e. , mesangial and neuronal cells, as well as melanoma and hepatoma cells). 

20 Hence, one can stimulate regeneration of tissue via, e.g., adding an amount of a TIF type 
molecule to a sample in need of regeneration of a tissue acted on by the TIF molecule. This 
approach can be used both in vitro, and in vivo. Similarly, antagonists of TIF may be added 
when the situation is one where the aim is to inhibit differentiation of a particular type of 
tissue, such as melanoma or hepatoma. 

25 The genes which encode TIF, as noted in Example 25, supra, are located on 

chromosome 12. This chromosome is associated with asthma, as is known in the art. Hence, 
a further embodiment of the invention is a method for determining susceptibility to conditions 
such as, or related to asthma, by determining if aberrations, such as polymorphisms, deletions, 
additions, etc., are present at the site of the TIF gene. Such aberrations may be an indicia of 

30 susceptibility to, or of the presence of, asthma, an allergic condition, or one or more related 
conditions. The ability to detect aberrations in a DNA sequence is well known in the art, and 
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such methods need not be set forth herein. Preferably, the aberration or aberrations is detected 
via standard techniques, such as PCR, using the methodologies and primers referred to supra . 
Other features of the invention will be clear to the artisan and need not be discussed 

further. 

The terms and expressions which have been employed are used as terms of description 
and not of limitation, and there is no intention in the use of such terms and expressions of 
excluding any equivalents of the features shown and described or portions thereof, it being 
recognized that various modifications are possible within the scope of the invention. 



WO 00/24758 



PCTAJS99/24424 



Claims 

1 . An isolated nucleic acid molecule which encodes a T cell derived inducible factor, the 
complementary sequence of which hybridizes, under stringent conditions, to at least 
one of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, SEQ ID 
NO: 24 or SEQ ID NO:25. 

2. The isolated nucleic acid molecule of claim 1, wherein said isolated nucleic acid 
molecule encodes a protein having the amino acid sequence of the protein encoded by 
SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, SEQ ID NO: 24 or 
SEQ ID NO: 25. 

3. The isolated nucleic acid molecule of claim 1, wherein said molecule is cDNA. 

4 . The isolated nucleic acid molecule of claim 1 , wherein said molecule is genomic DNA . 

5. The isolated nucleic acid molecule of claim 2, the nucleotide sequence of which 
consists of the nucleotide sequence SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, 
SEQ ID NO: 29, SEQ ID NO: 24 or SEQ ID NO:25. 

6. The isolated nucleic acid molecule of claim 4, having the nucleotide sequence of SEQ 
ID NO: 25. 

7. An isolated nucleic acid molecule which encodes the protein encoded by the isolated 
nucleic acid molecule of claim 1 . 

8. Expression vector comprising the isolated nucleic acid molecule of claim 1, operably 
linked to a promoter. 
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9. Expression vector comprising the isolated nucleic acid molecule of claim 2, operably 
linked to a promoter. 

10. Expression vector comprising the isolated nucleic acid molecule of claim 3, operably 
linked to a promoter. 

1 1 . Expression vector comprising the isolated nucleic acid molecule of claim 4, operably 
linked to a promoter. 

12. Expression vector comprising the isolated nucleic acid molecule of claim 5, operably 
linked to a promoter. 

13 Expression vector comprising the isolated nucleic acid molecule of claim 6, operably 
linked to a promoter. 

14. Recombinant cell comprising the isolated nucleic acid molecule of claim 1 . 

15. Recombinant cell comprising the isolated nucleic acid molecule of claim 2. 

16. Recombinant cell comprising the expression vector of claim 8. 

17. Recombinant cell comprising the expression vector of claim 9. 

18. Recombinant cell comprising the expression vector of claim 10. 

19. Recombinant cell comprising the expression vector of claim 1 1 . 

20. Isolated protein encoded by the isolated nucleic acid molecule of claim 1 , and having 
a molecular weight of about 17-30 kilodaltons as determined by SDS - PAGE. 
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21. The isolated protein of claim 20, comprising at least 120 amino acids of the protein 
encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, SEQ ID 
NO: 24, or SEQ ID NO: 25. 

22. The isolated protein of claim 21, comprising at least all but the 40 N terminal amino 
acids encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, 
SEQ ID NO: 24, or SEQ ID NO: 25. 

23 . The isolated protein of claim 22, comprising at least all but the 20 N terminal amino 
acids encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, 
SEQ ID NO: 24, or SEQ ID NO: 25. 

24. Antibody which specifically binds to the isolated protein of claim 20. 

25. The antibody of claim 24, wherein said antibody is a monoclonal antibody. 

26. A method for determining effectiveness of interleukin-9 on a cell, comprising 
contacting said cell with an agent specific for at least one of (i) an isolated nucleic acid 
molecule which encodes a protein whose amino acid sequence is identical to the amino 
acid sequence encoded by the nucleotide sequence of SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO: 9, SEQ ID NO:29, SEQ ID NO: 24 or SEQ ID NO: 25 and (ii) a protein 
whose amino acid sequence is identical to the amino acid sequence encoded by the 
nucleotide sequence of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO:29,SEQ ID NO: 24 or SEQ ID NO: 25 and determining interaction of said agent 
with (i) or (ii) as a determination of effectiveness of interleukin-9 on said cell. 

27. The method of claim 26, wherein said agent is an antibody which specifically binds to 
(ii). 
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28. The method of claim 26, wherein said agent comprises the isolated nucleic acid 
molecule of SEQ ID NO: 7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 29, SEQ ID 
NO:24, or SEQ ID NO: 25. 

29. A method for stimulating activation of a STAT protein comprising administering an 
amount of the protein of claim 20 sufficient to stimulate activation of said STAT 
protein. 

30. The method of claim 29, wherein said STAT protein is STAT1 , STAT3 or STAT 5. 

31. A method of inhibiting activation of a STAT protein, comprising administering an 
amount of an antagonist of the protein of claim 20 sufficient to inhibit stimulation of 
expression of said STAT protein by said protein. 

32. The method of claim 3 1 , wherein said STAT protein is STAT1 , STAT3 or STAT 5. 

33. A method for determining presence of TIF in a sample, comprising contacting said 
sample with an agent which binds to TIF or a nucleic acid molecule encoding TIF, and 
determining said binding as a determination of TIF in said sample. 

34. The method of claim 33, wherein said agent is an antibody. 

35. The method of claim 33, wherein said agent is a nucleic acid molecule. 

36 . A method for screening to determine if a substance influences IL-9 activity , comprising 
adding said substance to a sample of TIF producing cells, in the presence of IL-9, and 
determining production of TIF, wherein a difference in production of TIF by said cells 
as compared to production of TIF by said cells in presence of IL-9 but not said 
substance indicates said substance influences IL-9 activity. 
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37 The method of claim 36, wherein said substance is an IL-9 inhibitor or antagonist, said 
method further comprising determining lower levels of TIF production by said cells in 
the presence of said substance as compared to its absence. 

38. The method of claim 36, wherein said substance is an IL-9 activator, said method 
further comprising determining higher levels of TIF production by said cells in the 
presence of said substance as compared to its absence. 

39. A method for determining an aberrant level of IL-9 activity in a subject, comprising 
determining level of TIF in a subject and comparing said level to a normal level, 
differences therebetween being indicative of an aberrant level of IL-9 in said subject. 

40. The method of claim 39, wherein said aberrant level is excess endogenous IL-9. 

41 The method of claim 39, wherein said aberrant level is insufficient endogenous IL-9. 

42. The method of claim 40, wherein said subject suffers from asthma, an allergy, or 
lymphoma. 

43. A method for inhibiting IL-9 induced activity in a subject in need thereof, comprising 
administering an amount of a TIF inhibitor sufficient to inhibit IL-9 induced activity. 

44. The method of claim 43, wherein said TIF inhibitor is an antisense molecule. 

45. The method of claim 43, wherein said inhibitor is an antibody. 

46. A method for treating a subject suffering from asthma or an allergy, comprising 
administering to said subject an amount of a TIF mutein sufficient to alleviate said 
asthma or allergy. 

47. A method for determining if a mutein of TIF is therapeutically useful, comprising 
contacting a cell which produces IL-9 with said mutein, and determining effect of said 
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mutein on prouction of IL-9, reduction thereof being indicative of possible thereapeutic 
efficacy for said mutein. 

48. A method for determining susceptibility to a condition characterized by aberrant 
expression of TIF, comprising determining nucleotide sequence of a TIF gene of a 
subject believed to possess an aberrant TIF gene, presence of an aberrant TIF gene 
being indicative of possible susceptibility to asthma or allergy. 

49. The method of claim 48, comprising contacting a sample taken from said subject with 
a pair of oligonucleotide primers which amplify said TIF gene. 
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TIFalpha cDNA 



taaacaggctctcctctcacttatcaactgttgacacttgtgcgatctctgATGGCTGTCCTGCAGAAATCTATGAGTTT 

TTCCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATTGCCCTGTGGGCCCAGGAGGCAAA 
TGCGCTGCCCGTCA 

ACACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAGCCGTACATCGTCAACCGCACCTTTATGCT 
GGCCAAGGAGGCC 

AGCCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTTCCGAGGAGTCAGTGCT 
AAAGATCAGTGCTA 

CCTGATGAAGCAGGTGCTCAACTTCACCCTGGAAGACGTTCTGCTCCCCCAGTCAGACAGGTTCCA 
GCCCTACATG CAGG 

AGGTGGTACCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTCACATCAGCGGTGACGACCA 
GAACATCCAGAAG 

AATGTCAGAAGGCTGAAGGAGACAGTGAAAAAGCTTGGAGAGAGTGGAGAGATCAAGGCGATTGG 
GGAACTGGACCTGCT 

GTTTATGTCTCTGAGAAATGCTTGCGTCTGAgcgagaagaagctagaaaacgaagaactgctccttcctgccttctaaaa 

agaacaataagatccctgaatggacttttttactaaaggaaagtgagaagctaacgtccatcatcatlagaagatttcac 

atgaaacctggctcagttgaaaaagaaaataglgtcaagttgtccatgagaccagaggtagacttgataaccacaaagat 

tcattgacaatattttattgtcactgatgatacaacagaaaaataatgtactttaaaaaattgtttgaaaggaggttacc 

tctcaUcctUagaaaaaaagcUatgtaacttcatttccatatccaatattttatatatgtaagtttatttattataa 

gtatacattttatUatgtcagtttattaatatggatttatttatagaaacatlatctgctattgatatttagtataagg 

caaataatatttatgacaataactatggaaacaagatatcttaggctttaataaacacatggatatcataaaaaaaaaa 



MAVLQKSMSFSLMGTLAASCLLL1ALWAQEANALPVNTRCKLEVSNFQQPYIVNRTFMLAKEASLADNNT 
DVRUGEKLF 

RGVSAKDQCYLMKQVLNFTLEDVLLPQSDRFQPYMQEWPFLTKLSNQLSSCHISGDDQNIQKNVRRLK 

ETVKKLGESGE 

IKAIGELDLLFMSLRNACV 



aacaaactctcctctcagttatcaacttttgacactJgtgcgatcggtgATGGCTGTCCTGCAGAAATCTATGAG I M II 

CCC^ATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATTGCCCTGTGGGCCCAGGAGGCAAATG 

CGCTGCCCATCAAC 

ACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAGCCGTACATCGTCAACCGCACCTTTATGCTG 
GCCAAGGAGGCCAG 

CCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTTCCGAGGAGTCAGTGCTAA 
GGATCAGTGCTACC 

TGATGAAGCAGGTGCTCAACTTCACCCTGGAAGACATTCTGCTCCCCCAGTCAGACAGGTTCCGGC 
CCTACATGCAGGAG 

GTGGTGCCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTCACATCAGTGGTGACGACCAGA 
ACATCCAGAAGAA ^ 

TGTCAGAAGGCTGAAGGAGACAGTGAAAAAGCTTGGAGAGAGCGGAGAGATCAAAGCGATCGGGG 
AACTGGACCTGCTGT 

TTATGTCTCTGAGAAATGCTTGCGTCTGAgcgagaagaagctagaaaacgaagaactgctccttcctgccttctaaaaag 
aacaataagatccctgaatggacttttttactaaaggaaagtgagaa.gctaacgtcx»ccatcattagaagatttcacat 
gaaacctggctcagttgaaagagaaaatagtgtcaagttgtccatgagaccagaggtagacttgataaccacaaagattc 
attgacaatattttattgtcattgataatgcaacagaaaaagtatgtactttaaaaaattgtttgaaaggaggttacctc 




TIFbeta cDNA 
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lb NO'. 8 C CTtJ) 

tcattcctctagaagaaaagcclatgtaacttcatttccataaccaatactttatatatgtaagtttatttaUataagt 

atacattUatttatgtcagtttattaatatggatttattlatagaaaaattatctgatgHgatatttgagtataaagc 

aaataatatttatgataataactatagaaacaagatatctlaggctttaataaacacatgaatatcataaa 



TIFbeta Protein 



tO I)) 



MAVLQKSMSFSLMGTLAASCLLLIALWAQEANALPINTRCKLEVSNFQQPYIVNRTFMLAKEASLADNNTD 

VRLIGEKLF . . 

RGVSAKDQCYLMKQVLNFTLED1LLPQSDRFRPYMQEWPFLTKLSNQLSSCHISGDDQNIQKNVRRLKE 

TVKKLGESGE 
IKAIGELDLLFMSLRNACV 



ic Sequence 



TIFalpha Genomic 

GTCTATCACCTGCTTAAGATTCTTCTAATTTATAAAAAAAACW 
G CAC GTATTTA 

TAGCATGGTGTTCTGACCATGCAGGTACAGAGTGGAATGGTAAGAGGCGCTATTATCAGCATTAACC 
AACATGTTAATGT 

TTTCTTCTG G C AAGC AAACTTG AAATCTATGTCTTAAAC AATCTTC AAG CCTCTAATATAGTGCTAACG 
ACTGGAGTCCG 

CTGCTGTCCAACAGAGCTCTTGAGCACGCTCTCCTCTGTTTGCAATTTTATGTTCTTTGATCGACTCC 
GCAACGTCTCAC 

CTTCG G CTC CTG ATG G CCACCTTTCAACTTTCTG C ATTTATG AACTCCATGTTTTAATCTTTTTATTAA 
AATATTCACAC 

AATCAGTGTTTGTGCAAGTCTGTTTCACCCACATGTATGTCTGTGCACCAAGTGCTGCCTGGTGCTT 
GTGGGGGCAAGGA 

GCAGGAGAGGGTGCCCTGGCACCGGAGTCACGGATGGTTGTGAGCCACCATGAGGATGCTGGGAG 
TTAGACCCAGGTCCT 

CCAGAAGTGCAGCAAATGCTCTTAACCACACGCAGGCATTTCTCTCTCCAGCCCCAACATGAGTGCT 
TTTAG ATTC C ACC 

TAGAATAGAGATCTGATGGCTTCACTCACTGCCACCTCCCCTTTGCATCTTTCTGCCAAGGAACACC 
AAAAAGCAAGAAT 

CCCCACACTGCTTTCGCTCCTCAAGTCTGCACCTCTCAACAGGTCAAGATTCTCCAGTGTCCCTCTA 

ACACTTTCCCCAG 

TGTCCCTCTAACACTTTCTCCAGTGTGCCTCTAACACTTTCTCCAGTGTCCCTCTAACACTTTTGATCT 

CAATTAGCTGA 

G G G G AG AAAG ATCTCAC AC AGTG ATTTTC ATGACTTCGCGTTCTAGTCTAG ATGTAG GCATTTG CGT 
GTCAGTCTAGGGT 

AGGCGTCTGCTCCCGCTGCTTAGGAAAGACTTTCCTAGTCTAGTTGTCAGGTGCTATCTGGGATTCA 
GTGTACATACAAT 

GCAAAAAATCCCAGTATTTTGTAAATTCTCTTCTTCAACT 

TCATTTAAAAA 

TAATATTTTG AG ACTTATGCTTG C^Jy AAGTAAAATGTC AGAG AATTAG CAAATGTATAGTATTATTTTA 

\\\\ AAAAAAA 

TCTATG CTTAAAATGTCTATTAG ATTGTTCACTACCG ATATTTCCAAACTTAACTTG AC CTTGGCTATG 
ATTTCAACCTT 

TGTATTTGCATCTACCATAACAGTCTCTGAACCAGAACATTCTGTGGCAATGGGAGCTGTGAAGAAA 

GCCAACATTCTTA 

TTAAAAAAAAAAAACAGCTAGTTATAGT^^ 

TAAAAATTGA 
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AATAATCTC C AAGTTTTC ATTATG GCTTATTTCAAAGCACAG AATATAG G AC ACQ G GTCTTTTATTTr T 
GGTCACTTCTA ,UI 

AAGAGATAAGAATCTATGAAGTTGGTGGGAAAATGAGTCCGTGACCAAAACGCTGACTCAATAGCTA 
CGGGAGATCAAAG 

G CTG CTCTACTC AATC AG AATCTACTACG GC AAAGCCATG G CTTTCTTTG AAAACCGTG TTTAG AAfi A 
TTTCTGGGATTT 

GTGTGCAAAAGCACCTTGTTGGCCCTCACCGTGACGTTTTAGGGAAGACTTCCCATCTCTCAAGGTG 
GGAAGGCTTGGAG 

GTGGTGTCTTGTGGCCTCCTATGGTGGTTAGGTACTTCTCAGAAGACAGGACTGGAAATTAGATAAT 
GTCTG ATGTCATA 

TCATTCACAATACCAAAAAAACCCTGGTGTCCCGATGGCTATAAAAGCAGCAACTTCTGCCTCTCcr 
ATC AC AAG C AG AG 

ACACCTAAACAGGTAAGCACTCAGACCTCTACAGACAATCATCTGCTTGGTACCATGCTACCCGACG 
AAC ATG CTCCCCT " " " * 

GATGTTTTTGCCTTTTGCTCTCTCACTAACAGGCTCTCCTCTCACTTATCAACTGTTGACACTTGTGC 
GATCTCTGATGG 

CTGTCCTGCAGAAATCTATGAGTTTTTCCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATT 
GCCCTGTGGGCC 

CAGGAGGCAAATGCGCTGCCCGTCAACACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAGCC 
G TAG ATC G TC AAC CG 

CACCTTTATGCTGGCCAAGGAGGTACAGCTGCATCTCTTTCTCTCCATACCGCCTTGCCATTTTCTCT 
GAAGCACTTGCA ^ 

AACTCTTTAGGGGCGCTTTATCTCCGCAGGTCTCACTACCTATGTTTTCTGTCTCTTTAGAGACTCTT 
TAAGGACTGGGT 

0 1 I I I 1 CTATTTCTATTTCAAGGTCTCAGGACCATTTCCTATCTTGGCCTTCAGGACACATATACTGAA 
TTTTATCTACA 

GAGGCGCATTTAGAAAGCCACCCACGACTGCAATACTTTCCATTTCTCTGTGCTCTCTTCTGAACTCA 
TACTCTCTTGGC 

TACfcCTGAGACCCACTGCGGACATACATCTCTACTTACAGGCTTTTCTTCCATCTCCTTGTCACCCA 
GGCACTTAGGGT 

TTTCTCTCTTTCAGGCCAGCCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTT 
CCG AG G AGTC AGT 

GTAAGTCCTCACTGTGATGAGCAGGGCTAGCTGCGGGAGCTGGTGGACCCTCTGGGATAGTCTGAC 
GTATGACCCCTGCT 

GCTTCTTGTCTACCTGCAGGCTAAAGATCAGTGCTACCTGATGAAGCAGGTGCTCAACTTCACCCTG 
GAAGACGTTCTGC 

TCCCCCAGTCAGACAGGTTCCAGCCCTACATGCAGGAGGTGGTACCTTTCCTGACCAAACTCAGCA 
ATCAGCTCAGCTCC 

TGTGTAAGTCTGACTCTGGCTACCTATGCTCCTCTCTCTTCCTCTTCTATTCCAGTAAGAACCCGAGG 
TCCTGCCCTCTC 

TCTCTTCACAAGAGTGAGGAGGGCCTCAGCACCACCACCATCATAGGCCACTTGAAATAGGTCACAA 
AGGCTTTGGCTTC 

AATTGAGTAATACTTTGAGTTTGTATGAGTGAAGCT7TATTTGTTTTATCCATGGA 
AAATTCTGTAG 

GATGAGAAAGATGTTGGGAACGAAAAAAGGCCTAGATAGAGAAACAGATCTGCTGAGTATAGTACTT 
ATGGGGGGAGCAG 

GGGGCGATATCCACTGAGTACAAGTACTTGTGGGGAGAGAAATCCACTGAGTACAAGTACTTGTTG 
G CATG G AG ATCCAC 

TGAGTACAAGTACTTGTGGGGGGAGGGAATGGCACAGAGCAAAAGTTGAAGGGAAGGAAGATGGA 
GAG GCCTC ATGGTTG 

G G GGTGTG AAAG GTCACTCC TTTTCC ATGTG ATG G AG AGTTAAG AAAAACCAGTGTGTG AGTTTG AT 
GTCTTCAGACACC 

CCC AACTATGAAAC ATATCC ACG AG G AG CG GGC AGACTGTG G G AG ACCTG G CATTTAGGG AAGG C 
G CG G CTTTTC AC ACG 
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AGAAACTTTATGCTCATCTCTTGTGCTACACTCCCACCTTTGATGAGGTTCAGCTCAGGTTTCGTTTC 
TACCGTTCTTGC 

TACTGGTGGAAACTTCAGTAGGATTCCCCAAAGACGAGGACAGCTCTTCTGTAAGGGAGGGACCTG 
GATTTCAGTGTCCT 

AGAG AACGAAATAG CTCAG AG AATCTAGGTCAACGTG AAATCTAGGTC ACAG CGG G C AAAAATG ACT 
G AACG CC TCTATT 

CCAGGTGAACGGTCACGTGCCTCAGATATACTGAGGTATTGGGCTCCCACCGGATAAGATTCTGTTA 
GTGAGTCTGCTTT 

TATTTTGCAGCACATCAGCGGTGACGACCAGAACATCCAGAAGAATGTCAGAAGGCTGAAGGAGAC 
AGTG AAAAAG GTAC 

TATTGGCAAGCCACAATACTAAGCCATTCAGTAGGAGACGTGGGGATTTCTTTCTCTGCTTCCCAGT 
C CCT TCTAC TTTG 

TAACATTTTATTTG ACTTGTCTACTATCTGGTCCATTACTCGCTTAGCTG CACCTGTATCTAGCTG GGT 
CTATAGATCTT 

TCAATCTGTGTCTAAATTTGTAAGTCACAATTCTGGAGCTAGCAGAAAGCTTAGCTCAGCCAGTCTCA 
TGAGCACTTGCT 

CGGAGGATGGCTTGTGACAGAGTCAATGCTAGAAGACAGCATCCCTGATTCCCAGCTCTGCACTTG 
C CTAG TG G CCATGT 

GTAATTACTTTGGCTTGATTAAGTATTTGGGAAAGCCAGTTCCCACGGACCTACATAATCTGAAGAAC 
CATGCATTGAAA 

ACTAGAAAGCTGGGCACAAACTTACTAGAGATGATTTTTGAGCTCATTAAACGGATGCTCTGAAATGT 
GGCAAAATCAAC 

CCAGAATAACAACAAAAGAGCTGGATTTGCAAATAGGACAAGTATTTAGAATCACTGGTATTAATAGC 
TATCATCTTAAT 

T AAAAT ATAGG G CCTATATATATATTTAAG ATTAAAC ACAAGAGTGG ATAG C CTC CC AATTTACTTG G 
CCTGGTTTCAAA 

AGAGTAAAAATATCAGTCATGGATTAATTATAGTGTCATGAAAGTATGAGATGGAAACCCTTTCCTTA 
CTTTTTAC CTTC 

ATTTCTTAG I I I I 11 I I 1 I CTTCACACCCTGATCAAGCCACTAGTAAGCACCTATCTGCTGTGAGCTAT • 
TATATGACTTT 

ACAGCAAACAACATTGCTGTGTGGCCTCTTTGGGGAAGGGAACAGGATAGCAGGAGGCTCAGGCTA 
GCAAGTCTGACTTG 

CCCTAAAGCCAGAGGCATGGTTGATAGCAGAGAAAGTGAGGCTCTTCGCAAGTGGGTGTGCTTAAG 
TAATC AG AAAC AG G 

AAG GCTCC GGTTGATGGAATTATCAGTAAGATATCTACCCTTATCTCCTTCTATCGAACCTAAATCGT 
CTC I I I i I CTTG 

TGTGTAGGCTGATAAACACACTTGTTTTCTTTTGAGTGTTCATGGCTTTGTAG A1 I 1 I 1 AGTGCTCTGC 
CAGTTCTTGTT 

AGAGGGTTTGTTACCTTGACACCTGGGCTTGGATGTTAGCATGCCAAAGGCACACACTTCTGAATGC 
CTGTGTAAAAGGT 

TATTATTCATTTACTTTGTCTTTGGAAAGGTGAAGCGTGTGTGAGAAAGAACTCACAGGAGATGTGTT 
CTCTGTAGGAAA 

AC I I I I I n I I CCCCTTAAATG CCTATAATCCACTTTC AGTCAACTTTG ACTTTTATACCATG CTGTC AC 
ATGAAAGAGT 

GTTTAG G CCCGCTCTCATG GCTCTGGG AAAAG C ACC AATAGG G G AAGG AATGTTATG CTG AGAAAT 
CTGACCGGCAGGGA 

AACTGGTCAGAGCTCCCCCGAAGACCACCACAGGTGTTAAGTAGGAACAGTCCAGGGTGGGCTCAT 
GTAATAGAATGGAA 

CAGAGCGAGGGAAGATAAGCTACAAAGTTTCATAGGGTCCGGAGTCTTAAAGATACAAAATAGCTGC 
TTGGGCTTCATAA 

CAAAGGAAGTCTGGGAAGGCAGCAAGTGAGAGGGAAATGGAAAGGGAAAAAACAGAATGTAGAGG 
ACTTGAACAGCTACA 

AATCCTCTACCAGACGATTTTTCTTGGAACAATCTAGAAGGTAGTGGATTAGGTGATTGCAGGGGGA 
CTTGCTTTG CC AT 
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TTGAATCTGGG I Mil GTCTCTCCATTGAGGTTGAAAGCGTCACCCTTTTTACCCTCGAATGGAGGAG 
GAAAGAAGGGGT 

GTTATGACTCCTACCTGGAGTTTTACTAGTTTACGCAATGGAACAGACACTCGGGACCTCCTCTTGA 
C AAAAAAAATG G A 

AACCTGTTGTTTGTCTTGTTTGTTCTTTTGTTAAGAAAGCACAGGCAAAGCCCGACCACATGGGT^ 
ATGTGGGTCTTT 

GAGTCAAGGCTTTTGAGTTGAGCACTCATCAATAGTTGATCATGGTCAGGTGGAGGGCTACCTGTCA 
GGCCGAGCCCTGC 

TGGCTTCGCACTTAACATCTCCAGGTCTCAGTATCACTTCCTGCTACTTAGCACAGTTAGGAGTTGA 
GCAAACC I I I I I I 

TCCA^CCCCACTAAAATTTAATTGACAAAAGACTG 
CTATGTGTGCAT 

TGTGC AAGGTTC AATAAG ATAG ATTAATAG GCCC ATC AACAG CTTT ATG GGTGTG AAATG C AAGTAAT 
ATAGGTAGATGC 

CTG TG G TGTCCTTAG GTC AG AAAG G C ATG ATTTTAAG GTCTTGGG C AAATCATATTATACTC ATG CTA 
AAAATACATTAT 

GTTG ATT ATTAATCTTTT AG AG AAG G CTG ATACTTG GTTTTG GTG CTC AG CAAG C AAATGTC ACC AGC 
TCTTTCTAACTG 

GTACC ACTTTAGAAAATG CTACCTGTGCTCAAATTGGTTTGTATTCTTATTTrTCATAGCTTG GAG AG A 
GTGGAGAGATCA 

AGGCGATTG G G GAACTG GACCTG CTGTTTATGTCTCTG AG AAATG CTTG CGTCTG AG CG AG AAG AA 
GCTAGAAAACGAAG 

AACTGCTCCTTCCTGCCTTCTAAAAAG AACAATAAG ATCCCTGAATG G ACTTTTTTACTAAAG GAAAG 
TG AG AAG CTAAC 

GTCCATCATCATTAGAAGATTTCACATGAAACCTGGCTCAGTTGAAAAAGAAAATAGTGTCAAGTTGT 
CCATGAGACCAG 

AGGTAGACTTGATAACCACAAAGATTCATTGACAATATTTTATTGTCACTGATGATACAACAGAAAAAT 
AATGTACTTTA 

AAAAATTGTTTG AAAGGAG GTTACCTCTCATTCCTTTAG AAAAAAAG CTTATGTAACTTCATTTCCATA 
TCCAATATTTT 

ATATATG TAAGTTTATTTATT ATAAGTATAC ATTTTATTTATGTC AGTTTATTAATATG G ATTTATTTATA 
GAAACATTA 

TCTGCT ATTG ATATTTAGTATAAG G C AAATAATATTTATG AC AATAACTATG G AAAC AAG ATATCTTAG 
G C TTT AAT AAA 

C AC ATG G ATATC ATAAATCTTCTGTCTTGTAATTTTTCTCCCTTTAATATC AACAATACCATC ATC ATC A 
TCATTACCCA 

ATC ATTCTC ATG ATTTC ATG CTTG ACC CATATTATACTGTT AAAG TTGGTTCCTGGAGGCCTGTGGTT 
TTGTGTGTGTTG 

TGTGTGTGTGTGGG GTTATGCATGTG AAAGCCAG AGATGG ATATTAG GTGTTCTTCTCTATCAGTCTT 
TGCCTTATTATT 

TGAGACAGGGTCTGTCACTGAACCTGTAGCTAGGCTGGCCAACAAGCTCTATTA A ' I 11 I I II I AAGAT 

TAATTAATTATG 

TGTAT 



exon 1a: nucleotides 1908-1932 
exon 1b: nucleotides 2033-2262 
exon 2: 2655-2720 
exon 3: 2820-2963 
exon 4: 3931-3996 
exon 5: 6539-7137 
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Human TIF cDNA 



TGCACAAGCAGAATCTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTG 

TCTGCAATGGCCGCCCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTG 

CCTCCTTCTCTTGGCCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGC 
TTGACAAGT CCAACTTCC AG C AG CCCT AT ATCACCAACCGC ACCTTC ATG CTGG CTAAGG AGG CT 

AGCTTGGCTGATAACAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTAT 
GAGTGAGCGCTGCTATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTC 
AATCTGATAGGTTCCAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGG 
CTAAGCACATGTCATATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGA 
CACAGTGAAAAAGCTTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTA 
TGTCTCTGAGAAATG CCTGCATTTGACCAGAG CAAAGCTGAAAAATG AATAACTAACCCCCTTTC 
CCTGCTAGAAATAACAATTAGATGCCCCAAAGCGATTTTT 
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Human genomic sequence of TIF 

TGCACAAGC AG AATCT TC AG AAC AGG 7TCTCCTTCCCCAGTCACCAG * t GC7CG AGTT AG AATTGTCTGCAATGGCCGCCCTG CAG A 
AATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACC^^ 

CCATC AGCTCCC ACTGCAG G C7 TG AC AAG7CC AAC7TCCAG CAGCCCT ATA7CACCAACCGC ACCTTCATGCTGGCTAAGG AGg t a t 
acatctcaaccccgcccccccccgtcggatccacttggaazccaaacagrtcwcaaacttctcttcagagcacctctaagagcctta 
ggaacccaccgcttaccccccagggtagataaattttctcrtctcccfigagaccctttgggaatctggctctttttttttcttgaac 
tcccccctcccacctcgaCirctacgatacatatgacgaatttttcccaaagagcggccattcagcaatccacccgatgattttttt 
ctcctttatgcccccgrccc.;cgccccaaacccatgcacacatctgasccccgc5:tctagtccttatgatgctgctctggggagacg 

ggatggggcacatgtccaicracaaacttttttCctatttgctcaaccrccagacccttagtctttCcttctcttccagGCTAGCTT 
GGCTGATAACAACACAGACG7TCG7CTCATTGGGGAG AAACTGTTCCACGGAG TCAGTgt aagc t acag t tg tgacgaacagggccg 

tgcgccgtccacgggcaccwcgggccgtggtgatgatggtttaggzcccatcccccacgaccccccctgtctcccttccacccgcaq 
A7GAGrTGAGCGC7GC7A7CTGX7GAt k GCAGG7GC7GAAC77CACCCTTGA^ 

TATATGCAGGAGGTGGXGCCCTTCCTGGCCAGGCTCAGCAACAGGZTAAGCAGATGTgtaagttcagctcccagcctatgcccacct 
acccctcctcccctccccccacagagacccccccaccccaaccccc-iccccccccccctacccctaagctagcaggaagaagtgtct 
tggcagcagcg:tatcagga=tcacttgggatcatagagracttg=s = tcgccctgactgagccacaccccgagtttatagcggtga 
acggggcctggaacctaagtgracagaagccgcactggcttgtctcccgaaaaaaggcaacccaggttgcgtaagatgagaaaggtg 
tcgggaaaacac^tagc^gicgaaaeggatccactgagtctaagrrgtcgagcggaggggacggcacggagagaaattagaagagaa 
agtgggaaatccgaaggrrtaaagtcggtggtgggticggragacccr^gccccgccgatgrcatgggaagccacaaaatcggaggcg 
tgtgaactcgacgccgc;caa=a;ttgaaaccatgaaaaaaagcc:gagcggagtgggcccagtaaaaggccctaggacctaccgaa 
gagggctcaatccccacargagatgccttatgcacatccctrgcccraagcarccaacctcctggagatacgattgaggttttattc 
cttacagaat ctgcataaar^accccgctctctccacaaargcaaaccicagcaggactccccaaagatgaagagaggtctcttgta 
agggaagtgaccggatcctggcgtccaagggaacccaagagcccaggaaatctaggccaccgttgaaatctaggccactgtgggcaa 
aattactaagagcttcaatt=caggtgaattgtactgracctcca-gg=igwcgaggctcacaaagcctcagcacaacattaagata 
gtcacgcttcrcactc-rcca: agCATATTGAAGGTG ATGACCTGC At ATCCAGAGGAATG7GCAAAAGCTGAAGGACACAGTGAAA 

AAGgcaggaccgacaactc-caatgccaagccaCgcaacaggagagacaaaccrtgttttccCttccctcctttctccccaccactt 
tgtgattttccacccga-rcicccaccaccagcgcgatcactccggtc-ctgtctatgtagacatacccacatatctagatgccagt 
ttccaaatcttgcaaatcgtagaattctagaactggttgggatcctagrttgcctagtcacacaacctcagatcctggggatggcca 
gcggcagagacagggccagaargcaggcctcccgaaccccaagccagcactccccccggcggcgatacagattagetttggcaccat 
taacccttagggaaatci carat ccccattgactcacgtaatccgaagaagtacccgtttaaaaacagaaaaacgcccatgggcaaa 
ttcatttgaagtcatctcrcaagccaceaatgcattgccttgaaacttggaagaaeaaactcagaacaatgagaaaagagctggacc 
tgcacacagggczaacctccgcagcaacaaacactcatctzgaatzatcataatatccaccagatatcgattacagtttaaaagcaa 
gagcagacaaccccgatctcirrcacacaggttcaaatagagtaaaaatatcagcaagagatctattatagttaaacggaagcccga 
actggtaagcctccttccct;cctcccccccatcaagacc!:tcca*rccag£cccctccttcactccctcaacaaatccccagggag 
cattcatccatggcgggctcgrgcacacttccaragtgaatgacacca-catgrggcctacttggtgaaaagaacaacaatggaagg 
cttagactaacaacagtcicccaccccaaaaccggaggaatgaccaggagcagtgaaagtgacgctcttgcaagcaggtacaactaa 
atactcagaaacatgaargctccagtcgatggaatcttcagcaacaagcttaaccttaattccccctttttccctcctgacttttca 
aaaaagcgccccccccceagca:cacctaatgagcgcgacwctctc«*cctttgataatcgaaggccttgtagccttaaattgtgaa 
gcccagtcccctcgttaragaactaccatctagacatggagggcccaa-gtcagcacgccacagacaaggcatgctccacacatctt 
gcttaaaaaattactgat ttcaccttgcttgttgtctttacaaaagtgaagtgrgagagaggagaatctcatggtgatctgegtgat 
tttcaagacccccaacecattccgaaagaatcaatttcatatecgzaatgggtcgccatgtggaagagtgattatgctttfctfcgctg 
gtagcttcagaaagcacaggagggagagcaatgttgttcagagaaagatcaacaggaggagaaactgtcagagctgtctgaaatagg 
gtggetttgggaggcatcaaztccctctcgtcgggggtaaaagcagaacgcaggttggtagtaaaatgcatgacagacagtagggga 
cgataaactttaaaaccectcaeagccccggagcctttgagatagaaaagaatatctttteggccctacgtcaaaagaagtaeiggaa 
aggtgaaagggcggaagaaagcaggaaaaggaagaaccatgtatcatatagaggacaatggtgacaaggcttttcttgaaacaatgc 
aaatatgatagatcagaggaaicccagtagggaatgccctccactcgaacccgggtctcctcctcgattaagttcgggatcctcatc 
tgcatccgacttggagagagaaagaacgaatgtcaggacccacacctggctttccattaactaaagcaagcggaaaagacttatttg 
gtatttttcccacaaaaccgaaaacttttcttttactgrtcgtcaaataggtggaaatagaaaaagcctcaatgtattggtgaatac 
atggcccaaagtcatttcagtagagatgttttaaatcaggagcgcccaaccatccggctcccccggaccaccttgaaagaatcgtct 
eggtacacacataaaatacaagaacaacagccgacgagccaaaaaagtccatgcataaatctcatactgtttcaagaaagtttatga 
atctccgttagggcgcacccaaagctgccctgggccacgcgcggcccgrgggccgcaggttggacaagctccttataagtaatctgt 
catagatagttctggagrcgraaaacaggc^aaggcataargggcggcactcgggatcccccagatcccagccccacttcagtictcc 
tcgctctggtcaagaaggggcggtcaactctccgcccagcttttaaacagcttcattagtgcgaggtgcacctgaaattgatgeetg 
ctggcggcctctcagcccagagagccgtcatttcaagctcrttggcaaatcatacaacactaaagggatattaccatgaatgtttta 
caaatgcttaaaacccggcctctgtctccatcaacctaaccctgcaatccctaatctgttcaccctagaaaacacggcataaatgct 
caaacact CCCgcatCCCCatt CCcacagCTTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTC 
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TGAGWVATCCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAAT^ 
AAAGCGATTTTT 
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TIFbeta Genomic Sequence C 67* I ^ aJO> 1^1 

GAATTCAAGTCCACATGCAATCAATCCGAATACTTTGTAAATTCTCTTCTTCAAATATCCATCTATATA 
GTAT AAGTTATTG TAG G ATCATTTAAAAATAATG TTTTG AG ACTTATGTTTG C AC AAGTAAAATGTCAG 
AG AG AATTAG C AAATGTATAGTATTATTTTATTTTAAAAAATCTATG CTTAAAATGTCTATTAG ATTGTT 
CACTACTGACATTTCCAAACTTAACTTGACCTTGGCTATGATTTCAACCTTTGTATTTGCATCTACCAT 
AACTGTGTGCTCACTTACCATGCTATCCGACGAGCATGTTCCCCTGATGTTTTTGCCTTTTGCTCTCT 
CGCTAAC AG G CTCTCCTCTCAGTTATCAACTTTTG AC ACTTGTGCG ATCGGTG ATG GCTGTCCTGCA 
GAAATCTATGAGTTTTTCCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATTGCCCTGTGG 
GCCCAGGAGGCAAATGCGCTGCCCATCAACACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAG 
CCGTACATCGTCAACCGCACCTTTATGCTGGCCAAGGAGGTACAGCTGCATCTCTTTCTCTCCATAC 

CGCCTTGCCATTTCTCTGAAGCACTTGCAAACTCTTTAGGGGCGCTTTATCTCCGCAGGTCTCACTA 
CCTATG TTTTCTGTCTCTTTAG AG ACTCTTTAAG G ACTG G ATCTTTTTCTATTTCTATTTCAAG GTCTC 

AGGACCATTTCCTATCTTGGCCTTCAGGACACATATACTGAATTTTATCTACAGAGGCGCGTTTAGAA 

AGCCACCCACGACTGCAATACTTTCCATCCTGTTGTGCTCTCTTCTGAACTCATACTCTCTTGGCTAC 

TCCTGAGACC^CTGCGGACATACATCTCTACTTACAGGCTTTTCTTCCATCTCCTTGTCACCCAGGC 

ACTTAGGGTTTTCTCTCTTTCAGGCCAGCCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGA 
GAA 

ACTGTTCCGAGGAGTCAGTGTAAGTCCTCACTGTGATGAGCAGGGCTAGCTGCGGGAGCTGGTGGA 

CCCTCTGGGATAGTCTGACGTATGACCCCTGCTGCTTCTTGTCTACCTGCAGGCTAAGGATCAGTGC 

TACCTGATGAAGCAGGTGCTCAACTTCACCCTGGAAGACATTCTGCTCCCCCAGTCAGACAGGTTCC 

GGCCCTACATGCAGGAGGTGGTGCCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTGTAA 

GTCTGGCTCTGGCTACCTATGCTCCTCTCTCTTCCTCTTCTATTCCAGTAAGAACCCGAGGTCCTGC ■ 

CCTCTCTCTCTTCACAAGAGTGAGGAGGGCCTCAGCACCACCACCATCATAGGCCACTTGAAATAG 
GTC ACAAAG GCTTTGG CTTCAATTG AGTAATACTTTG AGTTTC 

CCATGGAAAGAAATCAACTCAAATTCTGTAGGATGAGAAAGATGTTGGGAACGAAAAAAGGCCTAGA 

TAGAGAAACAGATCTGCTGAGTACAGTACTTATGGGGGGGGGGGGCAGGGGGCGATATCCACTGA 

GTCCAAGTACTTG7TGGGAGAGAAATCCACTGAGTACAAGTACTTGTGGGGGAAGGAATGGCACAG 

AGCAA AAGTTGAAGGGAAAGAGGAAGATGGAGAGGCCTCAATGTTGGGGGTGTGAAAGGTCACTCC 

1 l I I I CCATGTGATGGAGAGTTAAGAAAAATCAGTGTGTGAGTTTGATGTCTTCAGACACCCCAACTA 

TGGCAGACTGTGGGAGACCTGGCATTTAGGGAAGGCGCGGCTTTTCACACGAGAAACTTTATGCTC 

ATCTCTTGTGCTACACTCCCACCTTTGATGAGGTTAAGCTCAGGTTTCGTTTCTACCGTTCTTGCTAC 

TGGTGGAAACTTCAGTAGGATTCCCCAAAGACGAGGACAGCTCTTCTGTAAGGGAGGGACCTGGAT 
TTCAGTGTCCTAGAGAACGAAA 

TAGCTCAGAGAATCTAGGTCAACGTGAAATCTAGGTCACAGCGGGCAAAAATGACTGAACGCCTCTA 

TTCCAGGTGAAC GGTCACGT GCCTCAGATATACTGAGGTATTGGGCTCCCACCGGATAAGATTCTGT 

TAGTGAGTCTGCTTTTATTTTGCAGCACATCAGTGGTGACGACCAGAACATCCAGAAGAATGTCAGA 

AGGCTGAAGGAGACAGTGAAAAAGGTACTATTGGCAAGCCACAATACTAAGCCATTCAGTAGGAGA 
C GTG G G G ATTTCTTTCTCTGCTTCCCAGTCTCTTCTA 

CTG GTCC ATTACTC ACTTAG CTG CACCTGCATCTAGCTGGGTCTATAGATCTTTCAATCTGTGTCTAA 
ATTTGTAAGTC AC AATTCTGG AG CTAG CAG AAAG CTTAG CTCAG CCAG TCTCATG AG CACTTG CTCG 
G AGG ATG G CTTGTG AC AG AGTCAATGCTAG AAG ACAG C ATCCCTG ATTCCCAG CTCTGCACTTG CCT 
AGTG G C C ACGTG TAATTACTTTAG CCTG ATTAAG TATTTGGGAAAG CCAATTCCCACCGACCTACATA 
ATCCG AAG AAGC ATG CATTGAAAACTAG AAAG CTGGG C ACAAACTTACTAG AG ATG ATTTTTG AGCT 
CATTAAACTG ATG CTCTGAAATGTG ATCAAATCAACCC AG AATAACAACAAAAG AGCTG G ATTTG CAA 
ATAGGACAAGTATTTAGAATCACTGljGTATTAACAGCTGTCATCTTAATrAAAATATAG 

CTG CCTATTTAAG ATTAAAC ACAAG AGTGG ATAACTTCCCAATTTACTGG G CCTG GTTTC AATAG AGT 
AAAAATATCAGTCATAGATTAATTATAGTGTCATGAAAGTATGAGTTGGAAA 

ACCTTCATTTCTTAGTTATTA rTTTTTTTl CTTCACACCCTGATCAAGCCACTAGTAAGCACCTATCTG 
CTGCGA 

GCTATTATATG ACTTTACAG C AAACAAC ATTGCTGTGTGGCCTCTTTG G G G AAG G G AAC AGG ATAGC 
AG GAG G CTCAGG C TAGC AAGTCTG G ACTCAACCTAAAG CCAG AGG CATG GTTGATAG CAG AG AAAG 
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TGAGGCTCTTCACAAGTGGGTGTGCTTAAGTAATCAGAAACAGGAAGGCTCTGGTTGATrr aattat 

CAGTAAGATATCTACCCTTATCTCCTTCTTCTATAGAAGCTAAACCGTCTCTCC^CTTGTGT^^^ 

CTGATAAACACGCTTGTTTTCTTTTGAGTGTTCATGGCTTTGCAGATTTTCAGTGCTCT 

GTTAGAGGGTTTGTTACCTTGACACCTGGGCTTGGATGTTAGCATGCCAAAGGCACACAC^ 
TGCCTGTGTAAAAGGmTTATTCATTTACTITGT^ 

TCACAGGAGATGTATTCTCTGTAGGAAAACTTTTTmCCCCrTAAAAGCCTATAA^ 
AACrTTGACTTTTATACCATGCTGTCACATGAAAGAGTGmAGGCCC 

aagcaccaataggggaagaaatgttatgccgagaaatctgactggcagggaa/^tgggtS^g^gct' 

CCCCAAAGACCACTACAGGTGTTAAGTAGGAACAGTCGAGGGTGGGTTCAWAATAGAA^^^^ 
GAGGGAGGGAAGATAAGCTACAAAGTTTCATAGGGTCCTAAGTCTTTAAGA^C^^I^^A^CT^rT^ 
GGGCTTCATAACAAAGGMGTCTGGGAAGGCAGCMGCATTGAGAGGGAGATGG^G^rl^YA 
CAATGTAGAGGATTTGAAAAGCTACAAATCCTCCACGAGAGGArrTTTCTTGGAGG^CTAGt^ 
CATTGAGGTTCA^3A ^ GTG G ATCG C AG AAGG ACTTG CTTTGCC ATTTG AATCTGG (^TTTTGTCTCTC 



GCGTCACCCTTTTTTACCCTGGATAGGAGGAGGAAAGAAGGGGTGTTTTGACTCCTACCTroAr 

TACTAGTTTACGCAATGGAACAGACACTCGGGACCTCCTCTTGACAAGAAAAa!^^ 

AACCTGTTGTTTCTCTTGTTTGTTCTTTTGTTAAGAAAGCACAGGCAGCTG^^ 

CCTTTAATCCCAGCArrTGGGAGGCAGAGGCAGGTGACmCTAAATTCAAGGCCAGCCTGGT^ 

AAAGTGAGTTCCAGGACAGCCAGGGCTATACAGAGAMCCCTGTCTCGGGAAAAA^^ 

AAAAGAAAAGAAAAGMGAGMGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGA^GAGA 

GGAGAGGAGAGGAGAGGAGAGGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAArAr 

AAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAAAGAAAAr 

AGAAAAGAAAAGAAAAAAGCAAGCAAGCAAGCACTGGCAAAGCATGCCCACATGGGACGTATr^rv' 

GTCTTTGAGACAAGGCTTTTGAATTGAGCGCTCATCAATAGTTGATCATGGTCAGGTGGAGGGC'^r 

CTGTCAGGCCGAGCCCTGCTGGCTTAGCACTTAACATCTCCAGGTCTCAGTATCACrrcCTrrTrrT 

TAGCACAGTrAGGAGTTGAGCAAACCTTTTTTrCCAACCCCCACTAAAA^ 

TGTAATTTGTGGGATACAGTGTGATAATTGATCTATGTGTGCATTGTGCAAGGTTCAATA^G^r^ 
CAATAGGCCCATCAACAGCTTTATGGGTGTGAAATGCAAGTAATATAGGTAGATGCCTGTG^'TCCT^ 

taggtcagaaaggcatgattttaaggtcttgggcamtcatattata^ 

GTTGATTATCAATCTTTTAGAGAA >vw\a l fc>L,ATTAT 

GGCTGATACTTGGTTTTGGTGCTCAGCAAGCAAATGTCACCAGCTCTTTCTAACTAGTACCACTTTAG 

AAAATGCTACCCGTGCTCAAATTGGTTTGTATTCTTATTTTCATAGCTTGGAGAGAGCGGAGAGATCA 

AAGCGATCGGGGAACTGGACCTGCTGTTTATGTCTCTGAGAAATGCTTGCGTCTGAGCGAG^GAA 

GCTAGAAAACGAAGAACTGCTCCTTCCTGCCTTCTAAAAAGAACAATAAGATCCCTGAATGGACTrTT 

TrACTAAAGGAAAGTGAGAAGCTAACGTCCACCATCATTAGAAGATTTCACATGAAACCTGGCTCAG 

TTGAAAGAGAAAATAGTGTCAAGTTGTCCATGAGACCAGAGGTAGACTTGATAACCACAAAGATTCA 

TTGACAATATTTTATTGTCATTGATAATGCAACAGAAAAAGTATGTACTTTAAAAAATTGTTTGAAAGG 

AGGTTACCTCTCATTCCTCTAGAAGAAAAGCCTATGTAACTTCATTTCCATAACCAATACTTTATV\TAT 

GTAAGTTTArrTATTATAAGTATACATrrTATTTATGTCAGTTTATTAATATGGATTrAmATAGAAA^ 

TTATCTGATGTTGATATTTGAGTATAAAGCAAATAATATTTATGATAATAACTATAGAAACAAGATATCT 

TAGGCTTTAATAAACACATGAATATCATAAATCTTCTGTCTTGTAATTTTTCTCCCTTTAATATCAACAA 

TACCATCATCGTCATCATTACCCAATCATTCTCATGACTTCATGCTTGACTCATATTATCTGGTAAAGT 
TTG 



exon 1: nucleotides 355-584 
exon2: 976-1041 
exon 3: 1141-1284 
exon 4: 2204-2269 
exon 5: 5225-5824 
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